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METHOD AND SYSTEM TO HANDLE LARGE VOLUME OF E-MAIL 
RECEIVED FROM A PLURALITY OF SENDERS INTELLIGENTLY 

Field of the invention: 

*5 This invention relates to a method and system to handle large volume of e-mail 
received from a plurality of senders and generate suitable response intelligently. 

Background of the invention: 

With an increase in e-mail usage there is a need to add powerful features to e- 
10 mail tools. E-mail usage is likely to increase at a phenomenal rate. This includes 
personal and official mail. With increasing mail volumes, users will feel the 
need to use more powerful e-mail tools. Some of the problems that are likely to 
be faced by users in the near future is: 

15 1 . Handling an enormous amount of mail. 

2. Retain quality of mail response for all the mails. Typically it has been 
observed that 

a. Mail response to mails read at the end of a day is poor in quality of 
content. 

20 b. Mail response to mails after the first 50 odd mails decreases 

steadily in quality. 

c. Human fatigue and urgency during office work also take their toll 
and sometimes users tend to be arbitrary in handling mail not 
giving the right attention at the right place. 

25 

This leads to a lot of problems in professional and personal scenarios. 
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The impact of this could be so far reaching especially in very influential and top 
positions (technical or management) that in a professional environment it could 
also lead to huge financial losses. 

So far no serious study has been done on the impact of arbitrary treatment of 
electronic mail on the productivity, effectiveness and balance sheets of 
companies and the solution that would alleviate some of its effects. 

With a splurge in dot.com companies, and a present 160 million global users, e- 
mail usage is likely to assume gargauntan proportions and it is likely that in the 
future companies would appoint e-mail screeners to screen and prioritize mail. 
It is estimated that 500 million users would be hooked to the net by 2003. Add 
to this the growing intranet and extranet usage which is also likely to increase 
with e-business. Presently the members of senior management in large 
organizations who handle high volume e-mail already have their secretaries to 
help them handle mail. 

In fact 90% of time spent by a manager in any industry is in communicating 
(including meetings, telephone calls, mail), in the coming years there is going to 
be major shift towards mail usage especially in non-IT industries where the 
emphasis of communication is going to shift strategically to electronic mail. 

With burgeoning e-mail quantity, there is a need to have special focus on the 
content of e-mail. E-mail usage is likely to become monotonous, ubiquitous and 
last but not the least extremely time consuming due to large volume. 
Consequently a great deal of conscious effort needs to be put into maintaining 
the quality of e-mail content especially in a business scenario. Arbitrary e-mail 
usage in an e-business scenario could lead to catastrophic effects. On the other 
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hand high quality e-mail content with richness and relevance is likely to have a 
very positive impact on an e-business. 

E-mail being a human activity so far, is therefore riddled by human problems 
like fatigue, lack of concentration and lack of time. There is a dire need for e- 
mail tools, which can alleviate the problems described above. 

Let us consider the existing scenario in a well connected company. 

While receiving large no: of e-mails: 

1. The number of e-mails could be sufficiently large that key individuals 
may not have the time to browse through the same and generate replies 
for each of them. 

2. While replying to a plurality of senders the user seldom remembers the 
significant contents of the mails sent by these senders over a period of 
time while composing the reply. Whatever little the user recollects is 
limited by his/her memory of the said detail. The absence of this takes 
away the richness and relevance of contents. Sometimes irrelevant 
content inclusion by oversight or poor memory leads to further needless 
mail exchange apart from bandwidth expense and other image/goodwill/ 
business damages. Precious time is anyway lost in the process. 

Presently mail handling is done by: 

a. Reading every single mail and replying to mails separately. This can 
become really cumbersome, tiring and time consuming especially if the 
number of related mails received is in the tune of hundreds or thousands. 
The quality of replies also decreases as the number of mails increases in 
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quantity. A modest estimate of the amount of time spent by an individual 
on mails is discussed below. 

b. As far as richness of content is concerned there is no systematic method 
used to lookup the relevant information from previous mails. This leads to 
understatements, misrepresentations, approximations, misunderstandings 
and sometimes leads to needless mail exchanges. In a business scenario 
this also leads to potential business loss. 

An E-mail usage survey: 

An e-mail usage survey was conducted on 20 members of a junior technical 
group on a normal business day in IBM Global Services India (P) Ltd. 
Following was the finding. 

Assumptions: 

1 . Lotus notes, Netscape mail, Unix mail and all other kinds of mail were 
included. 

2. One-liners are one-line messages per mail. 

3 . Small messages would contain 2-10 lines per mail. 

4. Medium length messages would contain 10-100 lines per mail. 

5. Long length messages would contain 100-500 lines per mail, including 
attachments. 
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Type of 
mail 


Estimated Time spent in seconds 


Time 
spent 
with 
reply 


Time 

Spent 

with 

out 

reply 


Time 

spent 

on 

new 

msg 




Choose 
& point 
to msg, 
read 

subject & 

sender, 

prioritize 


Open 
the 
msg 
in 


Read 

the 

msg 


Under 

stand 

the 

msg 


Re- 
read 
msg 


Fram 
e 

reply 


Revie 
w 

reply 








One 
liner 


1 


1 


1 


1 


0 


2 


0 


6/60 
mins 


4/60 
mins 


2/60 
mins 


Short 

message 


1 


1 


10 


5 


5 


10 


5 


37/60 
mins 


22/60 
mins 


15/60 
mins 


Medium 
message 


1 


1 


120 


60 


20 


120 


60 


6 

mins 

22 

sees 


3 mins 

22 

sees 


3 mins 


Long 
message 


1 


1 


480 


300 


60 


480 


300 


27 
mins 
2 sees- 


14 

mins 2 
sees 


13 

mins 





Avg. no: 


Avg. no: 


Avg. no: 


Avg no: 


Avg no: 


Avg no: 


Avg no: 




of mails 


of new 


of replies 


of one 


of short 


of 


of long 




received 


messages 


to 


liners 


messages 


medium 


messages 




in a day 


sent in a 


messages 


received 


received 


length 


received 




(include 


day 


sent in a 


per day 


per day 


messages 


per day 




official 




day. 






received 


(including 




and 










per day 


Attachme 




personal) 












nts) 


Mails 


14 


5 


5 


2 


6 


4 


1 



As can be seen from the above only 5 of 14 mails were needed to be replied to, 
which is 36 % of all mails received. 

Approximately 36% of total mails received were sent anew. 

Approximately 64% of all mails received were read but not replied to. 
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Assuming the % distribution shown above we get: 



Mail type 


Time taken 


One liners 


0.36 * 6/60 + 0.36 * 4/60 + 0.64 * 2/60 
4.88/60 minutes 


Short messages 


0.36 * 37/60 + 0.36 * 22/60 + 0.64 * 15/60 — 
30.84/60 minutes 


Medium 
messages 


0.36 * (6 + 22/60) + 0.36 * (3 + 22/60) + 0.64 * 3 = 
5.42 minutes 


Long messages 


0.36 * (27 + 2/60) + 0.36 * (14 + 2/60) + 0.64 * 13 - 
23 minutes 


Total 


28 minutes 35 seconds 28 minutes (approx.) 



This is a very modest estimate. 



Managers spend 90% of their time communicating (e-mail, telephone and 
meetings) and therefore the time spent by them on e-mail is much more than 
what is seen above. Senior management spends much more time in handling 
mail. 



Extrapolating the above figure of 28 minutes for every 14 mails we get the 
following data. 



No: of mails 
received in a 
business day 


Total time spent (including reading and replying 
to selected mails) 


10 


20 minutes 


50 


1 hour 40 minutes 


100 


3 hours 20 minutes 


500 


16 hours 40 minutes 


1000 


33 hours 20 minutes 



8572p-195.doc 



Some recent attempts at solving these problems are described in US patent no. 
5,948,058 and Japanese patent laid-open publication (Kokai) nos. Heisei 6- 
162085, Heisei 2-170642 and Heisei 4-351134. However, all these patents are 
limited in the scope of their solutions, as none of these utilize the power of 
available technology in the form of expert systems. Furthermore, none of these 
patents addresses the issue of generating replies to the received emails 
automatically. 

The object of this invention is to provide a method and a system for handling 
large amount of mail efficiently, effectively and intelligently including 
automatic generation of responses using an expert system. 

To achieve the said objective this invention provides in a computing system a 
method to handle large volume of e-mail received from a plurality of senders 
intelligently, by automatically processing each email based on a pre-determined 
classification system and stored information, said method comprising the steps 
of: 

receiving and sending the electronic mails, 

parsing the electronic mail header to capture keywords for the 
purpose of identifying the sender, the subject and specific key 
words and/or phrases, 

parsing the electronic mail body including attachments if any, for 
keywords and/or phrases for purpose of categorizing the e-mail for 
response, 

storing the said received emails in a personalized email database 
(PED), 

analyzing the emails stored in the PED for identifying co-relations 
among received e-mails using an expert system (ES) with machine 
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learning capabilities to assist the user in analyzing and preparing 
replies, 

preparing a reply template using a reply template generator (RTG), 
storing the email replies in said PED, 

configuring said PED and said ES using an personalized email 
database configurator (PEC) for updation. 

The above method further includes: 

storing of the received and sent e-mails in a mailbox (MB) within 
said PED, 

storing the result of the analysis by said expert system in New 
Knowledge Base (NKB) in the said PED. 

The above method further includes storage of personal data profile of the user, 
calendar of appointments / meetings, current job contents in said PED. 

The above method further includes the accessing of said PED over a network so 
as to make it useful to a travelling user. 

The above method further includes the accessing of said PED through 
appropriate facilities including palm pilots. 

The above method further includes: 

optionally generating the reply template, 

selecting mail type on which to generate reply template e.g. one- 
liner, short, medium long replies, 

enabling/disabling history search and intelligent reply template 
generation for specific type of mails for short mails, 



8572p-195.doc 



enabling/disabling history search and intelligent reply template 
generation for specific type of mails for cc'ed type or bcc'ed type 
or mails sent to newsgroups, 

specifying history search and intelligent reply template generation 
parameters like: 

• whether to search on subject and/or sender, 

• time period in which the messages need to be searched for, 

• type of message contents to be included/excluded, 
scheduling deletion of mails from the MB and NKB, 
scheduling sending of mails, 

specifying latest first or oldest first while generating relevant 
intelligent reply, 

specifying limits on inclusion of older reply contents - time period 

wise, volume wise and bandwidth wise, 

specifying criteria for inclusion/exclusion of keywords, 

providing access to multiple PEDs at various locations over the 

network, 

providing on-the-ffy exclusion / inclusion of original mail and 
reply contents including the various levels of replies and counter- 
replies 

by the user through said PEC. 

The above method further includes displaying said reply template on the screen 
by said RTG based on searches conducted within the NKB in said PED. 

The above method further includes displaying of: 
the mail received R 1 , 
reply sent to Rl - S 1, 
reply received on S 1 - R2, 
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reply sent to R2 - S2 5 

reply received on S2 - R3 .... 
by said RTG, serially and in chronological sequence, individually or in 
groups of Rl, R2, R3 or SI, S2, S3 or in any combination requested by 
the user, either in configurable colors and/or with changed font type and 
size. 

The above method further includes viewing and searching of the database by 
said RTG for relevant emails/messages with: 

the same subject, 

the same sender and same subject 

the same subject and any one of the recipients listed in the cc: list 
or the To: list and various other similar combinations. 

The said reply template is in the same format in which said attachments have 
been received. 

The above method further includes generation of co-relations and new 
associations by said ES using state of art and state of the practice techniques of 
NLP, AI, machine learning. 

The above method further includes searching said PED by said ES for co- 
relations amongst e-mails received 
sender wise, 

senders within a particular timeframe, 
thread wise or subject wise, 
sender and subject wise, 
sender, subject and date wise, 
sender, keyword wise. 
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In a computing system, a system to handle large volume of e-mail received from 
a plurality of senders intelligently, by automatically processing each email 
based on a pre-determined classification system and stored information, 
comprising: 

means for receiving and sending the electronic mails, 
means for parsing the electronic mail header to capture keywords 
for the purpose of identifying the sender, the subject and specific 
key words and/or phrases, 

means for parsing the electronic mail body including attachments if 
any, for keywords and/or phrases for purpose of categorizing the e- 
mail for response, 

means for storing the said received emails in a personalized email 
database (PED), 

means for analyzing the emails stored in the PED for identifying 
co-relations among received e-mails using an expert system (ES) 
with machine learning capabilities to assist the user in analyzing 
and preparing replies, 

means for preparing a reply template using a reply template 
generator (RTG), 

means for storing the email replies in said PED, 

means for configuring said PED and said ES using an personalized 

email database configurator (PEC) for updation. 

The above system further includes: 

means for storing the received and sent e-mails in a mailbox (MB) 
within said PED, 

means for storing the result of the analysis by said expert system in 
New Knowledge Base (NKB) in the said PED. 
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The above system further includes means for storing personal data profile of the 
user, calendar of appointments / meetings, current job contents in said PED. 

The above system further includes the means for accessing said PED over a 
network so as to make it useful to a travelling user. 

The above system further includes the means for accessing said PED through 
appropriate facilities including palm pilots. 

The above system further includes means for allowing the user through said 
PEC to: 

optionally generate the reply template, 

select mail type on which to generate reply template e.g. one-liner, 
short, medium long replies, 

enable/disable history search and intelligent reply template 
generation for specific type of mails for short mails, 
enable/disable history search and intelligent reply template 
generation for specific type of mails for cc'ed type or bcc'ed type 
or mails sent to newsgroups, 

specify history search and intelligent reply template generation 
parameters like, 

• whether to search on subject and/or sender, 

• time period in which the messages need to be searched for, 

• type of message contents to be included/excluded 
schedule deletion of mails from the MB and NKB, 
schedule sending of mails, 

specify latest first or oldest first while generating relevant 
intelligent reply, 
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specify limits on inclusion of older reply contents - time period 

wise, volume wise and bandwidth wise, 

specify criteria for inclusion/exclusion of keywords, 

provide access to multiple PEDs at various locations over the 

network, 

provide on-the-fly exclusion / inclusion of original mail and reply 
contents including the various levels of replies and counter-replies. 

The above system further includes means for displaying said reply template on 
the screen by said RTG based on searches conducted within the NKB in said 
PED 

The above system further includes means for displaying: 

the mail received R 1 , 

reply sent to Rl - S 1 , 

reply received on S 1 - R2, 

reply sent to R2 - S2, 

reply received on S2 - R3 .... 
by said RTG, serially and in chronological sequence, individually or in 
groups of Rl, R2, R3 or SI, S2, S3 or in any combination requested by 
the user, either in configurable colors and/or with changed font type and 
size. 

The above system further includes means for viewing and searching of the 
database by said RTG for relevant emails/messages with: 
the same subject, 

the same sender and same subject, 

the same subject and any one of the recipients listed in the cc: list 
or the To: list and various other similar combinations. 
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The above system further includes means for generating co-relations and new 
associations by said ES using state of art and state of the practice techniques of 
NLP, AI, machine learning. 

The above system further includes means for searching said PED by said ES for 
co-relations amongst e-mails received 
sender wise, 

senders within a particular timeframe, 
thread wise or subject wise, 
sender and subject wise, 
sender, subject and date wise, 
) - sender, keyword wise. 

A computer program product comprising computer readable program code 
stored on computer readable storage medium embodied therein for causing a 
computer to handle large volume of e-mail received from a plurality of senders 
intelligently, said computer program code comprising: 

computer readable program code means configured for receiving 

and sending the electronic mails, 

computer readable program code means configured for parsing the 
electronic mail header to capture keywords for the purpose of 
identifying the sender, the subject and specific key words and/or 
phrases 

computer readable program code means configured for parsing the 
electronic mail body including attachments if any, for keywords 
and/or phrases for purpose of categorizing the e-mail for response, 
computer readable program code means configured for storing the 
said received emails in a personalized email database (PED), 
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computer readable program code means configured for analyzing 
the emails stored in the PED for identifying co-relations among 
received e-mails using an expert system (ES) with machine 
learning capabilities to assist the user in analyzing and preparing 
replies, 

computer readable program code means configured for preparing a 
reply template using a reply template generator (RTG), 
computer readable program code means configured for storing the 
email replies in said PED. 

computer readable program code means for configuring said PED 
and said ES using an personalized email database configurator 
(PEC) for updation. 

The above computer program product further includes: 

computer readable program code means configured for storing of 
the received and sent e-mails in a mailbox (MB) within said PED, 
computer readable program code means configured for storing the 
result of the analysis by said expert system in New Knowledge 
Base (NKB) in the said PED. 

The above computer program product further includes computer readable 
program code means configured for storage of personal data profile of the user, 
calendar of appointments/ meetings, current job contents in said PED. 

The above computer program product further includes computer readable 
program code means configured for accessing said PED over a network so as to 
make it useful to a travelling user. 
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The above computer program product further includes computer readable 
program code means configured for accessing said PED through appropriate 
facilities including palm pilots. 



The above computer program product further includes computer readable 
program code means configured for allowing the user through said PEC to: 
optionally generate the reply template, 

select mail type on which to generate reply template e.g. one-liner, 
short, medium long replies, 

enable/disable history search and intelligent reply template 
generation for specific type of mails for short mails, 
enable/disable history search and intelligent reply template 
generation for specific type of mails for cc'ed type or bcc'ed type 
or mails sent to newsgroups, 

specify history search and intelligent reply template generation 
parameters like, 

• whether to search on subject and/or sender, 

• time period in which the messages need to be searched for, 

• type of message contents to be included/excluded, 
schedule deletion of mails from the MB and NKB, 
schedule sending of mails, 

specify latest first or oldest first while generating relevant 
intelligent reply, 

specify limits on inclusion of older reply contents - time period 

wise, volume wise and bandwidth wise, 

specify criteria for inclusion/exclusion of keywords, 

provide access to multiple PEDs at various locations over the 

network, 
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provide on-the-fly exclusion / inclusion of original mail and reply 
contents including the various levels of replies and counter-replies. 

The above computer program product further includes computer readable 
program code means configured for displaying said reply template on the screen 
by said RTG based on searches conducted within the NKB in said PED. 

The above computer program product further includes computer rebadable 
program code configured means for displaying of: 

the mail received Rl 

reply sent to Rl - SI 

reply received on SI - R2 

reply sent to R2 - S2 

reply received on S2 - R3 .... 
by said RTG, serially and in chronological sequence, individually or in 
groups of Rl, R2, R3 or SI, S2, S3 or in any combination requested by 
the user, either in configurable colors and/or with changed font type and 
size. 

The above computer program product further includes computer readable 
program code means configured for viewing and searching of the database by 
said RTG for relevant emails/messages with: 

the same subject, 

the same sender and same subject 

the same subject and any one of the recipients listed in the cc: list 
or the To: list and various other similar combinations. 

The above computer program product further includes computer readable 
program code means configured for generating of co-relations and new 



8572p-195.doc 



- 17- 



associations by said ES using state of art and state of the practice techniques of 
NLP, AI, machine learning. 

The above computer program product further includes computer readable 
program code means configured for searching the said PED by said ES for co- 
relations amongst e-mails received 
sender wise, 

senders within a particular timeframe, 
thread wise or subject wise, 
sender and subject wise, 
sender, subject and date wise, 
sender, keyword wise. 

The invention will now be described with reference to the accompanying 
drawings: 

Fig. 1 shows the entity diagram of the system, according to this invention. 

Fig. 2 shows the flow diagram describing the operation of the system, according 
to this invention. 

Fig. 3 shows the flow diagram describing the operation of the expert system. 

Fig. 4 shows the flow diagram of the personalized email database configurator 
(PEC). 

As shown in figure 1, email header parser (1) parses the received email header 
to extract information as defined by the user. The email message is then parsed 
by email body parser (2) to extract user defined keywords and/or phrases. The 
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received email is then stored in mailbox (MB) (4) located in personalized email 
database (PED) (3). The PED contains 2 logical parts, the mailbox (MB) and the 
New Knowledge Base (NKB). The MB contains mail archives whereas the 
NKB contains assertions derived from the mail archives. The MB contains all 
sent and received mail while the NKB contains information generated by the 
expert system (ES) (9). Reply template generator (RTG) (6) retrieves the 
received email from the mailbox (MB) (4) and generates the output reply 
template in accordance with user defined rules on the monitor (7) of the system. 
Personalized email database configurator (PEC) (8) enables the user to 
configure the personalized email database system (PED) (3), RTG (6) as well as 
the expert system (ES) (9), which operates on the personalized email database 
system (PED) (3) using machine learning algorithms in order to identify co- 
relations between different email messages and generate new knowledge base 
(NKB) (5), which is used by the reply template generator (RTG) (6) in 
formulating the output reply. The invention also provides a facility wherein the 
PED is accessible over a network, so that it can be accessed by a travelling user. 
Appropriate facilities are also provided to support wireless access to the PED 
using pervasive computing equipment like palm pilots etc. Personalized e-mail 
archive or database (PED) (Supports Netscape Inbox format, notes nsf format, 
Claris format, Eudora format, Microsoft Outlook Express format, IE file format, 
Unix mbox format, Qualcomm file format, etc. with the help of converters). 

Referring to figure 2, the email receiving means receives the email (1 1). The 
email header is then parsed (12) by a text parser (1) to extract information about 
sender, subject, domain, address for reply including CC, Bcc, Newsgroup 
information, as defined by configuration data. The email is then further parsed 
for the body contents (13) by text parser (2) to identify whether the email needs 
to be replied to immediately, later or no reply is required. Further, the parser 
searches for user defined keywords and/or phrases/clauses. If no reply is to be 



8572p-195.doc 



- 19- 



generated the email along with the parsed information is archived (21) in the 
PED (3). If a reply is required, the reply template generator (RTG) is invoked to 
generate a reply template (15), which includes: 

generation of appropriate salutation and end-of-message signature 

details, as configured 

inclusion of original email contents as configured , 

searching of mailbox (MB) and new knowledge base (NKB) in 

personalized email database (PED) based on configured parameters 

including 

• message type (one-liner, short, medium, long) 

• subject, sender, recepient, keyword, thread, or combination 
thereof, 

to extract information for inclusion in the reply 

If the reply is required immediately, the generated template is displayed to the 
user on the monitor of the system in the configured format (18). The user 
completes the reply (20) and the system archives the reply in the PED (21) for 
subsequent transmission. If however, the email is to be replied-to later, the 
generated reply template is stored (17) in the PED and the PED is set up to 
generate an alarm or reminder for the user at the appropriate time (19). 

Figure 3 describes the functioning of the expert system, which operates on the 
PED to generate new knowledge base (NKB). The expert system searches (22) 
the PED for co-relations among messages based on configured parameters. It 
then generates new knowledge representations using machine learning 
techniques. 
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Machine Learning - Using the ES and the PED. 

A few cases are explained to illustrate the nature and scope of machine learning 
in an e-mail scenario. 

Case -1: First level knowledge generation 

Consider the case when an e-mail is received from a user u l9 subject si, 
possessing a significant keyword ki. Then an assertion a L can be extracted from 
this e-mail by the expert system and placed in the NKB. 

An assertion is a trite and terse conversion of the sentence containing the 
keyword. 

In reality assertions could represent opinions of users, or situations involving 
users, or statements of fact. 

The ES would use NLP techniques (symbolic logic, predicate calculus etc.) to 
convert the English string into an assertion that is updated into the NKB. The 
RTG accesses the NKB while generating a reply template. 

Let's say that: 
User : Ui 
Subject : si 
Keyword : ki 

For each set of (ui,Si, ki) at least one assertion ai can be generated. 

The ES can be configured to generate more than one assertion per set of 
(ui,si,ki). 
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Now, if this assertion ai happens to involve either Ui or si or a combination of Ui 
and si, this knowledge could be used whenever there is a query on a keyword 
involving (ui,s 2 ) or (u 2 ,Si). 

Now ai could be suggested as the new assertion while framing a reply template 
to any of the users ui or u 2 . 

This can be used to refer to the NKB for the most useful and relevant assertion, 
which can be solicited by the user by using a hot key from the RTG. 

The machine learning engine of the ES updates the NKB with the knowledge 
representations like (u l5 Si,ki, ai), (u 2 ,si,k l3 a 2 ) etc. 

Thus as the number of e-mails increases, the NKB is updated with new 
information by the machine learning engine which comes with new associations 
of user, subject, keyword and assertion combinations. 

Case -2 : Second level knowledge generation 

Since the NKB has records with associations of the type (u,s,k,a) 

We could have various assertions generated on a particular set and also there 
could be unique assertions that can be further derived from these assertions. 

Assuming that ai to a n are the assertions derived from various sets of user, 
subject, keyword etc. 

Let A be the set of all assertions generated using the first level knowledge 
generation technique shown above on the U, S, K. Where 

U - Set of all users 
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S - Set of all subjects 

K - Set of associate keywords. 

Then : 

A - ai, a 2 , a 3 , a„ would be the set of all assertions generated at the first level. 

Newer assertions can be generated using the above associations like 
ai2 can be generated using ai and a 2 if appropriate. 
a i2 3 can be generate using ai, a 2 and a 3 

Similarity, 

ai23...k can be generated using assertions ai to a^ 
Case -3 : Generating higher levels of knowledge 

The assertions generated from the 2 nd level knowledge generation can be further 
used to generate assertions at a higher level of knowledge using the same 
technique described above. 

Other issues: 

The above mentioned cases are examples of implementation at the conceptual 
level. The cases mentioned above implement "learning" by updating the NKB 
with new knowledge representations in the form of assertions that can be 
derived from (U,S,K) combinations and their sub-components. 

Then arises the question as to what should be the extent to which these 
knowledge representations can be updated to the NKB. 
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The extent to which knowledge representations can be derived will have to be 
made with care and caution else it would reach a situation where the ES repeats 
knowledge generation adinfinitum. 

A situation could also be reached wherein the ES starts generating totally 
irrelevant knowledge. 

Standard rules of thumb can be used initially by fixing an upper limit on 
assertion generation at each level of knowledge. 

Gradually this can be improved/changed by observing the trend and historical 
evidence. The ES can also be used to convert all of the MB entries into NKB. 
This would ensure storing only the right and relevant information and later 
would pave the way for e-mail content standardization. The auxiliary benefits of 
such an approach would be reduced disk space for e-mail mail boxes. 

Provisions can also be made to change the strategy periodically/continually 
depending on identifiable parameters, which are part of the ES and NKB. 

Review of advanced mail features of existing e-mail tools in comparison with 
the expert system used in this invention 
Here is a brief review of leading e-mail clients. 

Claris Emailer 2.0 Version 3 - Provides filtering facilities, hierarchical 
mailboxes, provides a window that lists the results of searches. Does not allow 
users to send styled text with HTML mail. 

QuickMail Pro 1.5.2 - Provides good filtering and mailbox management 
features. But does not provide sufficient HTML support. 
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Microsoft Outlook Express 4.0 - Good filtering facilities, hierarchical 
mailboxes. HTML support and conforms to the Open Internet standard. 

Netscape Messenger 4.041 -Conforms to Open Internet Standard apart from 
providing most of the above mentioned facilities. 

Eudora Light 3.13 - Offers hierarchical mailboxes and filters. Also provides 
for adding dockable windows, the ability to create filters quickly based on the 
current message and multiple nick names. 

As can be seen from the above none of them offer the kind of facilities and 
features discussed in this invention. 

Figure 4 defines the operation of the personalized email database configurator 
(PEC). The PEC is used by the user to configure the RTG (25) for any one or 
more of the following options: 

1 . Automatic/manual reply template generation 

2. Choose mail type on which to generate reply template : e.g oneliner, short, 
medium, etc. 

3. Generate reply template including the original mail received. This shall 
include appropriate salutation and end of message signature details. The 
Reply Template Generator (RTG) interfaces with the PED, the ES and the 
monitor. The salutation and signature details are configurable. 

4. Search sender- wise, thread-wise, keyword wise automatically either 
while generating reply template or while composing a new mail. There 
are function keys(Hot keys) in the reply template generator that can be 
configured to realise this. 
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5. Search PED object wise. Namely the invention is equipped to identify 
various file formats and is equipped to display a miniaturised versions of 
the object in a frame for easy association, identification and possible 
inclusion in a reply message or new mail message. 

6. Provide frames on the screen where more information on searches made 
and hits found can be obtained. 

7. Provide additional frames to search and view contents from the NKB. 

8. Include information seen/searched from the frames into the reply by the 
RTG. 

9. View a short form of the message, without the embedded replies 
optionally. This is only to facilitate easier viewing of one's own message, 
whether it is new or a reply to a received mail. The invention also 
provides a facility to view by prompting 

- Mail received Rl 

- Reply sent to Rl -SI. 

- Reply received on S 1 - R2 

- Reply sent to R2 - S2 

- Reply received on S2 - R3 

etc. The above can be viewed serially in chronological sequence or 
viewed individually or viewed in groups of Rl, R2,R3 or S1,S2,S3 etc. or 
in any combination as requested by the user. They can be reviewed either 
in different configurable colours and /or with changed font type and size. 

10. View messages sender wise on the basis of keywords, date, sender/e-mail 
domain and subject by searching and extracting information from the 
PED. 

11. View consolidated short form messages sender wise on the basis of 
keywords, date, sender/e-mail domain, objects, attachments and subject. 

12. View consolidated long form messages sender wise on the basis of 
keywords, date, sender/e-mail domain and subject. 
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13. View consolidated full length form messages sender wise on the basis of 
keywords, date, sender/e-mail domain and subject. 

14. Invoke feature where-in for messages of any or more amongst the various 
e-mail types one-line, short, medium, long and attachments, the tool 
searches the database for relevant messages with 

• the same subject 

• the same sender and same subject 

• the same subject and any one of the recipients listed in the cc: list or 
the To: list. 

and various other similar combinations. 

The PEC is further used by the user for configuring the operation of the expert 
system (ES) in terms of defining keywords and associations and the levels of 
generation of assertions to operate on. The ES may also be configured for 
converting all or some of the mailbox (MB) entries into the new knowledge 
base (NKB). The ES can be configured to generate assertions user-wise with 
parameters some of which are listed here for the sake of illustration: 

• using ascending or descending chronology of mail for assertion 
generation. 

• using user-wise, key-word-wise, subject-wise limits on assertion 
generation at each level of knowledge. 

• using assertion-wise limits on new assertion generation at higher levels of 
knowledge. Care should be taken to ensure that the ES does not 
stretch/exceed available software and hardware resources. 

The PEC is also used to configure the personalized email database (PED) for 
archiving of mails in the mailbox (MB) and for the new knowledge base (NKB) 
content. The configuration can be defined for any of the following: 
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• Store/File e-mails sent and received cumulatively in the PED that is 
configurable. Scheduled deletion of mails is provided for. 

• Automatically trim PED in order to remove no longer relevant mails, the 
invention accepts parameters that specify what kind of mails/attachments 
need to be deleted from the PED so as to occupy optimal disk space and 
increase the relevance and efficiency of searches made. This trimming can 
be done on both the MB and the NKB. 

• Enable/disable history search and intelligent reply template generation for 
specific kinds of mails. For e.g.: short mails. 

• Enable/disable history search and intelligent reply template generation for 
specific types of mails. For e.g. Cc'ed type or bcc'ed type or mails sent to 
newsgroups. 

• Specify history search and intelligent reply template generation parameters 
like: 

• Whether to search on subject and/or sender 

• Time period in which the messages need to be searched for. 

• Type of message contents to be included/excluded 

• Scheduled deletion of mails from the MB and NKB 

• Scheduled sending of mails. 

• Specify latest first or oldest first while generating relevant intelligent reply 

• Specify limits on inclusion of older reply contents - time period wise, 
volume wise and bandwidth wise. 

• Specify search criteria inclusion/exclusion of keywords. 

• Provide for accessing multiple PED at various locations over the network. 

• Provide for on-the-fly exclusion/inclusion of original mail and reply contents 
including the various levels of replies and counter - replies. 
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Advantages of expert system based intelligent email system: 

1. Time spent on mail reply and new mail composition is reduced 
significantly. The time reduced is almost to the tune of 50 % using the E- 
mail results discussed earlier, because the user has to read the generated 
reply with the original mail at the same time. 

2. The content of e-mail replies is made richer and relevant with details 
chosen from the PED containing past mails/messages. These details can 
be excluded by the user if they are found irrelevant or unnecessary. 

3. Selected types of mails can be sent automatically to certain individuals 
without the users' intervention, especially like calendar plans, 
acknowledgements and replies to meeting requests. 

4. The invention uses parsing techniques and intelligently generates reply 
templates using historical data and assertions available in the PED. The 
volume and content of these reply templates and search mechanisms are 
fully configurable and can be chosen by the user to suit his/her e-mail 
profile and nature of e-mails. All of this leads to a general improvement 
in e-mail handling, storage, searching and reply generation. 

5. The most significant feature of this invention is knowledge creation in the 
form of assertions in the NKB, which is part of the PED. This shall lead 
to a general improvement in e-mail content quality. 

6. Conversion of MB contents into NKB assertions would pave the way to 
convert unstructured and haphazard information into structured 
knowledge that would be searchable and easily usable. The other 
auxiliary benefit would be that there would be massive savings on disk 
space occupied by e-mail mailboxes. 
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Claims: 



In a computing system a method to handle large volume of e-mail 
received from a plurality of senders intelligently, by automatically 
processing each email based on a pre-determined classification system 
and stored information, said method comprising the steps of : 
receiving and sending the electronic mails, 

parsing the electronic mail header to capture keywords for the 
purpose of identifying the sender, the subject and specific key 
words and/or phrases, 

parsing the electronic mail body including attachments if any, for 
keywords and/or phrases for purpose of categorizing the e-mail for 
response, 

storing the said received emails in a personalized email database 
(PED), 

analyzing the emails stored in the PED for identifying co-relations 
among received e-mails using an expert system (ES) with machine 
learning capabilities to assist the user in analyzing and preparing 
replies, 

preparing a reply template using a reply template generator (RTG), 
storing the email replies in said PED, 

configuring said PED and said ES using an personalized email 
database configurator (PEC) for updation. 



25 2. A method as claimed in claim 1 further comprising: 

storing of the received and sent e-mails in a mailbox (MB) within 
said PED, 

storing the result of the analysis by said expert system in New 
Knowledge Base (NKB) in the said PED. 
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A method as claimed in claim 1 further including storage of personal data 
profile of the user, calendar of appointments / meetings, current job 
contents in said PED. 



4. A method as claimed in claim 1 further comprising the accessing of said 
PED over a network so as to make it useful to a travelling user. 

5. A method as claimed in claim 1 further comprising the accessing of said 
10 PED through appropriate facilities including palm pilots. 

6. A method as claimed in claim 1 further comprising: 

optionally generating the reply template, 

selecting mail type on which to generate reply template e.g. one- 

15 liner, short, medium long replies, 

enabling/disabling history search and intelligent reply template 
generation for specific type of mails for short mails, 
enabling/disabling history search and intelligent reply template 
generation for specific type of mails for cc'ed type or bcc'ed type 

20 or mails sent to newsgroups, 

specifying history search and intelligent reply template generation 
parameters like: 

• whether to search on subject and/or sender, 

• time period in which the messages need to be searched for, 
25 • type of message contents to be included/excluded, 

scheduling deletion of mails from the MB and NKB, 
scheduling sending of mails, 

specifying latest first or oldest first while generating relevant 
intelligent reply, 
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specifying limits on inclusion of older reply contents - time period 
wise, volume wise and bandwidth wise, 
specifying criteria for inclusion/exclusion of keywords, 
providing access to multiple PEDs at various locations over the 
5 network, 

providing on-the-fly exclusion / inclusion of original mail and 
reply contents including the various levels of replies and counter- 
replies 

by the user through said PEC. 

10 

7. A method as claimed in claim 1 further including displaying said reply 
template on the screen by said RTG based on searches conducted within 
the NKB in said PED. 

15 8. A method as claimed in claim 1 further including displaying of: 

the mail received R 1 , 

reply sent to Rl -SI, 

reply received on S 1 - R2, 

reply sent to R2 - S2, 
20 - reply received on S2 - R3 .... 

by said RTG, serially and in chronological sequence, individually or in 
groups of Rl, R2, R3 or SI, S2, S3 or in any combination requested by 
the user, either in configurable colors and/or with changed font type and 
size. 

25 

9. A method as claimed in claim 1 further including viewing and searching 
of the database by said RTG for relevant emails/messages with: 
the same subject, 
the same sender and same subject 
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the same subject and any one of the recipients listed in the cc: list 
or the To: list and various other similar combinations. 

10. A method as claimed in claim 1 wherein said reply template is in the 
5 same format in which said attachments have been received. 

11. A method as claimed in claim 1 further comprising generating co- 
relations and new associations by said ES using state of art and state of 
the practice techniques of NLP, AI, machine learning. 

10 

12. A method as claimed in claim 1 further including searching the said PED 
by said ES for co-relations amongst e-mails received 

sender wise, 

senders within a particular timeframe, 
15 - thread wise or subject wise, 

sender and subject wise, 
sender, subject and date wise, 
sender, keyword wise. 

20 13. In a computing system a system to handle large volume of e-mail 
received from a plurality of senders intelligently, by automatically 
processing each email based on a pre-determined classification system 
and stored information, comprising: 

means for receiving and sending the electronic mails, 
25 - means for parsing the electronic mail header to capture keywords 

for the purpose of identifying the sender, the subject and specific 
key words and/or phrases, 



8572p-195.doc 



-33 - 



means for parsing the electronic mail body including attachments if 
any, for keywords and/or phrases for purpose of categorizing the e- 
mail for response, 

means for storing the said received emails in a personalized email 
database (PED), 

means for analyzing the emails stored in the PED for identifying 
co-relations among received e-mails using an expert system (ES) 
with machine learning capabilities to assist the user in analyzing 
and preparing replies, 

means for preparing a reply template using a reply template 
generator (RTG), 

means for storing the email replies in said PED, 

means for configuring said PED and said ES using an personalized 

email database configurator (PEC) for updation. 

14. A system as claimed in claim 13 further comprising: 

means for storing the received and sent e-mails in a mailbox (MB) 
within said PED, 

means for storing the result of the analysis by said expert system in 
New Knowledge Base (NKB) in the said PED. 



15. A system as claimed in claim 13 further including means for storing 
personal data profile of the user, calendar of appointments / meetings, 
current job contents in said PED. 

25 

16. A system as claimed in claim 13 further including the means for 
accessing said PED over a network so as to make it useful to a travelling 
user. 
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17. A system as claimed in claim 13 further including the means for 
accessing said PED through appropriate facilities including palm pilots. 

18. A system as claimed in claim 13 further comprising means for allowing 
the user through said PEC to: 

optionally generate the reply template, 

select mail type on which to generate reply template e.g. one-liner, 
short, medium long replies, 

enable/disable history search and intelligent reply template 
generation for specific type of mails for short mails, 
enable/disable history search and intelligent reply template 
generation for specific type of mails for cc'ed type or bcc'ed type 
or mails sent to newsgroups, 

specify history search and intelligent reply template generation 
parameters like, 

• whether to search on subject and/or sender, 

• time period in which the messages need to be searched for, 

• type of message contents to be included/excluded 
schedule deletion of mails from the MB and NKB, 
schedule sending of mails, 

specify latest first or oldest first while generating relevant 
intelligent reply, 

specify limits on inclusion of older reply contents - time period 

wise, volume wise and bandwidth wise, 

specify criteria for inclusion/exclusion of keywords, 

provide access to multiple PEDs at various locations over the 

network, 

provide on-the-fly exclusion / inclusion of original mail and reply 
contents including the various levels of replies and counter-replies. 
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19. 



A system as claimed in claim 13 further including means for displaying 
said reply template on the screen by said RTG based on searches 
conducted within the NKB in said PED 



20. A system as claimed in claim 13 further comprising means for displaying: 

the mail received Rl , 
reply sent to Rl -SI, 
reply received on S 1 - R2, 
10 - reply sent to R2 - S2, 

reply received on S2 - R3 .... 
by said RTG, serially and in chronological sequence, individually or in 
groups of Rl, R2, R3 or SI, S2, S3 or in any combination requested by 
the user, either in configurable colors and/or with changed font type and 
is size. 

21. A system as claimed in claim 13 further including means for viewing and 
searching of the database by said RTG for relevant emails/messages with: 

the same subject, 
20 - the same sender and same subject, 

the same subject and any one of the recipients listed in the cc: list 
or the To: list and various other similar combinations. 

22. A system as claimed in claim 13 further including means for generating 
25 co-relations and new associations by said ES using state of art and state 

of the practice techniques of NLP, AI, machine learning. 

23. A system as claimed in claim 13 further including means for searching 
said PED by said ES for co-relations amongst e-mails received 
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sender wise, 

senders within a particular timeframe, 
thread wise or subject wise, 
sender and subject wise, 
sender, subject and date wise, 
sender, keyword wise. 

A computer program product comprising computer readable program 
code stored on computer readable storage medium embodied therein for 
causing a computer to handle large volume of e-mail received from a 
plurality of senders intelligently, said computer program code 
comprising: 

computer readable program code means configured for receiving 
and sending the electronic mails, 

computer readable program code means configured for parsing the 
electronic mail header to capture keywords for the purpose of 
identifying the sender, the subject and specific key words and/or 
phrases 

computer readable program code means configured for parsing the 
electronic mail body including attachments if any, for keywords 
and/or phrases for purpose of categorizing the e-mail for response, 
computer readable program code means configured for storing the 
said received emails in a personalized email database (PED), 
computer readable program code means configured for analyzing 
the emails stored in the PED for identifying co-relations among 
received e-mails using an expert system (ES) with machine 
learning capabilities to assist the user in analyzing and preparing 
replies, 
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computer readable program code means configured for preparing a 
reply template using a reply template generator (RTG), 
computer readable program code means configured for storing the 
email replies in said PED. 
5 - computer readable program code means for configuring said PED 

and said ES using an personalized email database configurator 
(PEC) for updation. 

25. A computer program product as claimed in claim 24 further comprising: 

10 - computer readable program code means configured for storing of 

the received and sent e-mails in a mailbox (MB) within said PED, 
computer readable program code means configured for storing the 
result of the analysis by said expert system in New Knowledge 
Base (NKB) in the said PED. 

15 

26. A computer program product as claimed in claim 24 further including 
computer readable program code means configured for storage of 
personal data profile of the user, calendar of appointments / meetings, 
current job contents in said PED. 

20 

27. A computer program product as claimed in claim 24 further including 
computer readable program code means configured for accessing said 
PED over a network so as to make it useful to a travelling user. 

25 28. A computer program product as claimed in claim 24 further comprising 
computer readable program code means configured for accessing said 
PED through appropriate facilities including palm pilots. 



8572p-195.doc 



-38 - 



29. A computer program product as claimed in claim 24 further comprising 
computer readable program code means configured for allowing the user 
through said PEC to: 

optionally generate the reply template, 
5 - select mail type on which to generate reply template e.g. one-liner, 

short, medium long replies, 

enable/disable history search and intelligent reply template 
generation for specific type of mails for short mails, 
enable/disable history search and intelligent reply template 
10 generation for specific type of mails for cc'ed type or bcc'ed type 

or mails sent to newsgroups, 

specify history search and intelligent reply template generation 
parameters like, 

• whether to search on subject and/or sender, 

is • time period in which the messages need to be searched for, 

• type of message contents to be included/excluded, 
schedule deletion of mails from the MB and NKB, 
schedule sending of mails, 

specify latest first or oldest first while generating relevant 

20 intelligent reply, 

specify limits on inclusion of older reply contents - time period 

wise, volume wise and bandwidth wise, 

specify criteria for inclusion/exclusion of keywords, 

provide access to multiple PEDs at various locations over the 

25 network, 

provide on-the-fly exclusion / inclusion of original mail and reply 
contents including the various levels of replies and counter-replies. 
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30. A computer program product as claimed in claim 24 further including 
computer readable program code means configured for displaying said 
reply template on the screen by said RTG based on searches conducted 
within the NKB in said PED. 

31. A computer program product as claimed in claim 24 further including 
computer readable program code means configured for displaying of: 

the mail received Rl 

reply sent to Rl - S 1 

reply received on S 1 - R2 

reply sent to R2 - S2 

reply received on S2 - R3 .... 
by said RTG, serially and in chronological sequence, individually or in 
groups of Rl, R2, R3 or SI, S2, S3 or in any combination requested by the 
user, either in configurable colors and/or with changed font type and size. 

32. A computer program product as claimed in claim 24 further including 
computer readable program code means configured for viewing and 
searching of the database by said RTG for relevant emails/messages with: 

the same subject, 

the same sender and same subject 

the same subject and any one of the recipients listed in the cc: list 
or the To: list and various other similar combinations. 

33. A computer program product as claimed in claim 24 further including 
computer readable program code means configured for generating of co- 
relations and new associations by said ES using state of art and state of 
the practice techniques of NLP, AI, machine learning 
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A computer program product as claimed in claim 24 further including 
computer readable program code means configured for searching the said 
PED by said ES for co-relations amongst e-mails received 
sender wise, 

senders within a particular timeframe, 
thread wise or subject wise, 
sender and subject wise, 
sender, subject and date wise, 
sender, keyword wise. 
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ABSTRACT 



This invention relates to a method, system and computer program product for 
intelligently handling a large volume of emails received from a plurality of 
senders by automatically parsing the email header and body to capture specified 
keywords and preparing a reply template using an expert system to analyze the 
received emails that are stored in a personalized email database. 
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